Some Experiments on the Use of One-channel Noise Reduction Techniques with the Italian Speechdat Car Database

نویسندگان

  • M. Matassoni
  • M. Omologo
  • A. Santarelli
  • P. Svaizer
چکیده

In this work the use of noise reduction techniques for handsfree speech recognition in car environment is investigated. A set of experiments was carried out using different speech enhancement algorithms based on noise estimation. In particular, linear subtraction and MMSE estimators are considered in their various configurations, which depend on a different set of parameters. Experiments were conducted on connected and isolated digits, extracted from the Italian version of the SpeechDatCar database. As a result, spectral subtraction with a suitable choice of the oversubtraction factor led to more than 30% relative performance improvement, from 94.39% to 96.15% digit recognition accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature vector selection to improve ASR robustness in noisy conditions

It is well known that noise reduction schemes are beneficial in ASR to reduce training-test mismatch due to noise. However, a significant mismatch may still remain after noise reduction, especially in the non-speech portions of the signals. To reduce the impact of this mismatch, two methods for discarding non­ speech acoustic vectors at recognition time are investigated: variable frame rate pro...

متن کامل

The speechdat-car multilingual speech databases for in-car applications: some first validation results

The main objective of SpeechDat-Car is to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. SpeechDat-Car started in April 1998 in the 4th EC framework under project code LE4-8334. The duration of the project is 30 months. Equivalent and similar resources for nine languages will be created: Danish, English, ...

متن کامل

SPEECHDAT-CAR. A Large Speech Database for Automotive Environments

The aims of the SpeechDat-Car project are to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. As a result, a total of ten (10) equivalent and similar resources will be created. The 10 languages are Danish, each language 600 sessions will be recorded (from at least 300 speakers) in seven characteristic envir...

متن کامل

SpeechDat-Car Fixed Platform

SpeechDat-Car aims to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. Two types of recordings compose the database. The first type consist of wideband audio signals recorded directly in the car while the second type is composed by GSM signals transmitted from the car and recorded simultaneously in a far-en...

متن کامل

Quantile based histogram equalization for online applications

The noise robustness of automatic speech recognition systems can be increased by transforming the signal to make the cumulative density functions of the signal’s values in recognition match the ones that where estimated on the training data. This paper describes a real–time online algorithm to approximate the cumulative density functions, after Mel scaled filtering, using a small number of quan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001